Contextualized embeddings for semantic change detection: Lessons learned

نویسندگان

چکیده

We present a qualitative analysis of the (potentially erroneous) outputs contextualized embedding-based methods for detecting diachronic semantic change. First, we introduce an ensemble method outperforming previously described approaches. This is used as basis in-depth degrees change predicted English words across 5 decades. Our findings show that can often predict high scores which are not undergoing any real shift in lexicographic sense term (or at least status these shifts questionable). Such challenging cases discussed detail with examples, and their linguistic categorization proposed. conclusion pre-trained language models prone to confound changes senses contextual variance, naturally stem from distributional nature, but different types issues observed based on static embeddings. Additionally, they merge together syntactic aspects lexical entities. propose range possible future solutions issues.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Distributional Similarity with Lessons Learned from Word Embeddings

Recent trends suggest that neuralnetwork-inspired word embedding models outperform traditional count-based distributional models on word similarity and analogy detection tasks. We reveal that much of the performance gains of word embeddings are due to certain system design choices and hyperparameter optimizations, rather than the embedding algorithms themselves. Furthermore, we show that these ...

متن کامل

Change management lessons learned for Lean IT implementations

Lean Management is a standard production mode that has been familiar to production organizations for several decades. To date, however, academic literature has presented surprisingly little information about the application of Lean Management in Information Technology (IT) organizations, or what is called Lean IT. Drawing upon an empirical qualitative case study of the IT departments of two mul...

متن کامل

Duqu: Analysis, Detection, and Lessons Learned

In September 2011, a European company sought our help to investigate a security incident that happened in their IT system. During the investigation, we discovered a new malware that was unknown to all mainstream anti-virus products, however, it showed striking similarities to the infamous Stuxnet worm. We named the new malware Duqu, and we carried out its first analysis. Our findings led to the...

متن کامل

Contextualized knowledge repositories for the Semantic Web

We propose Contextualized Knowledge Repository (CKR): an adaptation of the well studied theories of context for the Semantic Web. A CKR is composed of a set of OWL 2 knowledge bases, which are embedded in a context by a set of qualifying attributes (time, space, topic, etc.) specifying the boundaries within which the knowledge base is assumed to be true. Contexts of a CKR are organized by a hie...

متن کامل

A Contextualized Knowledge Framework for Semantic Web

This thesis focuses on developing a framework for contextualized knowledge representation on Semantic Web. One of the main concerns of usability for such a framework in the Semantic Web is computational efficiency and implementability. We take an existing framework that is being build in our group[1] and adapt it to RDF(S) for which open source repositories are available. In this proposal, we d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Northern European Journal of Language Technology

سال: 2022

ISSN: ['2000-1533']

DOI: https://doi.org/10.3384/nejlt.2000-1533.2022.3478